Structural and Semantic Modeling of Audio for Content-Based Querying and Browsing

نویسندگان

  • Mustafa Sert
  • Buyurman Baykal
  • Adnan Yazici
چکیده

A typical content-based audio management system deals with three aspects namely audio segmentation and classification, audio analysis, and content-based retrieval of audio. In this paper, we integrate the three aspects of content-based audio management into a single framework and propose an efficient method for flexible querying and browsing of auditory data. More specifically, we utilize two robust feature sets namely MPEG-7 Audio Spectrum Flatness (ASF) and Mel Frequency Cepstral Coefficients (MFCC) as the underlying features in order to improve the content-based retrieval accuracy, since both features have some advantages for distinct types of audio (e.g., music and speech). The proposed system provides a wide range of opportunities to query and browse an audio data by content, such as querying and browsing for a chorus section, sound effects, and query-by-example. In addition, the clients can express their queries in the form of point, range, and k-nearest neighbor, which are particularly significant in the multimedia domain.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Developing a BIM-based Spatial Ontology for Semantic Querying of 3D Property Information

With the growing dominance of complex and multi-level urban structures, current cadastral systems, which are often developed based on 2D representations, are not capable of providing unambiguous spatial information about urban properties. Therefore, the concept of 3D cadastre is proposed to support 3D digital representation of land and properties and facilitate the communication of legal owners...

متن کامل

Hierarchical Semantic Content Analysis and Its Applications in Multimedia Summarization and Browsing

Television (TV) services have been becoming increasingly important in our daily life. With the advent of communication technologies, a large variety of TV content is delivered to customers through various channels, and an important challenge for content providers is to efficiently manage such abundant content. In this chapter, we first review key techniques in semantic multimedia (audio-visual ...

متن کامل

Beyond the Query-By-Example Paradigm: New Query Interfaces for Music Information Retrieval

The majority of existing work in music information retrieval for audio signals has followed the content-based query-by-example paradigm. In this paradigm a musical piece is used as a query and the result is a list of other musical pieces ranked by their content similarity. In this paper we describe algorithms and graphical user interfaces that enable novel alternative ways for querying and brow...

متن کامل

Integrating Structure and Semantics into Audio-visual Documents

Describing audio-visual documents amounts to consider documentary aspects (the structure) as well as conceptual aspects (the content). In this paper, we propose an architecture which describes formally the content of the videos and which constrains the structure of their descriptions. This work is based on languages and technologies underlying the Semantic Web and in particular ontologies. Ther...

متن کامل

ANNODEX-ing Broadcast TV News for Semantic Browsing and Retrieval

The development of a semantic web of text, images, structured information, and continuous media information including video and audio, depends on being able to annotate content with semantic meaning and then use that content description directly in applications. ANNODEX is a recently announced initiative to allow continuous media files to be integrated directly with their own content descriptio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006